fp16 and bfloat16 precision for dot by daineAMD · Pull Request #655 · ROCm/rocBLAS

daineAMD · 2019-08-15T17:48:50Z

Summary of proposed changes:

add fp16 precision with fp32 accumulation for dot
add bfloat16 precision with fp32 accumulation for dot

…BLAS into dotbf16

amcamd · 2019-08-20T14:15:52Z

 // Template to dispatch testing_gemm_ex for performance tests
-// When Ti == void or complex, the test is marked invalid
+// When Ti == void or complex or Ti == To == Tc == bfloat16, the test is marked invalid
 template <typename Ti, typename To = Ti, typename Tc = To, typename = void>


I do not see a test for Ti == complex. Can the comment be updated if there is no test for complex.

The gemm_ex and gemm_strided_batched_ex templates act on exclusion rather than inclusion like the other templates. The exclusion of complex types was removed when complex gemm was added. I added exclusion for bfloat16 types here as they are now permissible in type_dispatch.hpp, but not for gemm_ex. Changed the comment to reflect this in #f746ec6.

amcamd · 2019-08-20T15:27:23Z

+#############################################
+Half bfloat single double complex real: &half_bfloat_single_double_complex_real_precisions
+  - *half_precision
+  - *bfa_precision


I think bf16 is more descriptive than bfa. Is it possible to make this bf16?

Good point, changed in #f746ec6.

…BLAS into dotbf16

daineAMD · 2019-08-22T19:47:08Z

All tests (quick, pre-checkin, nightly) pass on gfx900 and gfx906.

amcamd

LGTM

leekillough

LGTM

Co-authored-by: Andrew Chapman <andrew.chapman@amd.com>

daineAMD added 4 commits August 12, 2019 12:50

Addition of rocblas_half and rocblas_bfloat16 precisions for dot.

aee1712

Cleanup for half precision dot.

5cc8992

Merge branch 'develop' of https://github.com/ROCmSoftwarePlatform/roc…

d85a603

…BLAS into dotbf16

Merge branch 'develop' of https://github.com/ROCmSoftwarePlatform/roc…

9bfa813

…BLAS into dotbf16

daineAMD requested review from amcamd and leekillough August 15, 2019 17:48

amcamd reviewed Aug 20, 2019

View reviewed changes

daineAMD added 2 commits August 20, 2019 14:39

Small cleanup.

f746ec6

Merge branch 'develop' of https://github.com/ROCmSoftwarePlatform/roc…

156b092

…BLAS into dotbf16

amcamd approved these changes Aug 22, 2019

View reviewed changes

leekillough approved these changes Aug 22, 2019

View reviewed changes

daineAMD merged commit 73a01cf into ROCm:develop Aug 22, 2019

daineAMD deleted the dotbf16 branch October 29, 2019 20:05

mlse-lib-jenkins pushed a commit that referenced this pull request Apr 26, 2021

Revert "Lwpmlse 124 opt small n dot (#636)" (#654) (#655)

860afa7

Co-authored-by: Andrew Chapman <andrew.chapman@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fp16 and bfloat16 precision for dot#655

fp16 and bfloat16 precision for dot#655
daineAMD merged 6 commits into
ROCm:developfrom
daineAMD:dotbf16

daineAMD commented Aug 15, 2019

Uh oh!

amcamd Aug 20, 2019

Uh oh!

daineAMD Aug 20, 2019 •

edited

Loading

Uh oh!

amcamd Aug 20, 2019

Uh oh!

daineAMD Aug 20, 2019

Uh oh!

daineAMD commented Aug 22, 2019

Uh oh!

amcamd left a comment

Uh oh!

leekillough left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

daineAMD commented Aug 15, 2019

Uh oh!

amcamd Aug 20, 2019

Choose a reason for hiding this comment

Uh oh!

daineAMD Aug 20, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

amcamd Aug 20, 2019

Choose a reason for hiding this comment

Uh oh!

daineAMD Aug 20, 2019

Choose a reason for hiding this comment

Uh oh!

daineAMD commented Aug 22, 2019

Uh oh!

amcamd left a comment

Choose a reason for hiding this comment

Uh oh!

leekillough left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

daineAMD Aug 20, 2019 •

edited

Loading